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Abstract 

Random walks on expander graphs were thoroughly studied, with the important motivation 
that, under some natural conditions, these walks mix quickly and provide an efficient method 
of sampling the vertices of a graph. The authors of [2] studied non-backtracking random walks 
on regular graphs, and showed that their mixing rate may be up to twice as fast as that of the 
simple random walk. As an application, they showed that the maximal number of visits to a 
vertex, made by a non-backtracking random walk of length n on a high-girth n-vertex regular 
expander, is typically (1 -I- o(l)) i^°f^„ , as in the case of the balls and bins experiment. They 
further asked whether one can establish the precise distribution of the visits such a walk makes. 

In this work, we answer the above question by combining a generalized form of Brun's sieve 
with some extensions of the ideas in [2] . Let Nt denote the number of vertices visited precisely 
t times by a non-backtracking random walk of length n on a regular n-vertex expander of 
fixed degree and girth g. We prove that if 5 = ^^(1), then for any fixed t, Nt/n is typically 
^ -|- 0(1). Furthermore, if g = f2(loglogn), then Nt/n is typically "'"^"p^ uniformly on all 
t < {I - o(l)) iog°iogn s-n^ ^ > (1 + 0(1)) log log « ■ particular, we obtain the above 

result on the typical maximal number of visits to a single vertex, with an improved threshold 
window. The essence of the proof lies in showing that variables counting the number of visits 
to a set of sufficiently distant vertices are asymptotically independent Poisson variables. 



1 Introduction 



1.1 Background and definitions 



Random walks on graphs have played a major role in Theoretical and Applied Computer Science, as 
under some natural requirements (related to the notion of expander graphs) , these walks converge 
quickly to a unique stationary distribution, and enable efficient sampling of this distribution. This 
fact was exploited for example in [1] , [5] and [13] , in the study of space efficient algorithms for 5 — T 
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connectivity in undirected graphs. Another well known example is the conservation of random bits 
in the amplification of randomized algorithms (we will elaborate on this point later on). In many 
applications for random walks, it seems that using non-backtracking random walks may yield 
better results, and that these walks possess better random-looking properties than those of simple 
random walks. This motivated the authors of [2j to study the mixing-rate of a non-backtracking 
random walk, and show that it may be up to twice as fast as that of a simple random walk. They 
further show that the number of times that such a walk visits the vertices of a high-girth expander is 
random-looking, in the sense that its maximum is typically asymptotically the same as the maximal 
load of the classical balls and bins experiment. In this paper, we further examine this setting, and 
answer a question raised in p| by giving a precise description of the limiting distribution of these 
visits. 

We briefly mention some well known properties of random walks on regular graphs; for further 
information, see, e.g., [Tl], [Hj. Let G = iV^E) denote a d-regular undirected graph on n vertices. 
A random walk of length k on G from a given vertex wq is a uniformly chosen member W S W, 
where W = {{wq^wi, . . . ,Wk) ■ Wi^iWi € E} is the set of all paths of length k starting from wq. 
Alternatively, a random walk on G, 7W, is a Markov chain whose state space is V, where the 
transition probability from u io v \s Puv = 'i-uveE/d. The transition probability matrix of M is 
doubly stochastic, and the uniform distribution 7r{u) = 1/n is a stationary distribution of A^. If G 
is connected and non-bipartite then A4 is irreducible and aperiodic, in which case it converges to 
the unique stationary distribution vr, regardless of the starting point wq. These two sufficient and 
necessary conditions have a clear formulation in terms of the spectrum of G, which also dictates 
the rate at which M converges to vr. 

The adjacency matrix of G is symmetric and thus has n real eigenvalues, all at most d in 
absolute value (by the Perron- Frobenius Theorem). Let d = Ai > A2 > . . . > A„ denote these 
eigenvalues. It is simple and well known (see, e.g., [8J) that the multiplicity of the eigenvalue d is 
equal to the number of connected components of G, and the minimal eigenvalue An is equal to —d 
iff G has a bipartite connected component. Therefore, letting A = maxj>i |Aj| denote the maximal 
absolute value of all non-trivial eigenvalues, we obtain that G is connected and non-bipartite iff 
A < d. The quantity d — X is often referred to as the spectral gap of G, and is strongly related 
to the expansion properties of the graph. In particular, when d — A is bounded from below by a 
constant, we call the graph an expander (a closely related notion of expander graphs is defined by 
the expansion ratio of each set of at most n/2 vertices to its neighbor vertices in the graph). See 
|10j for a survey on the many fascinating pseudo-random properties exhibited by such graphs. 

An (n, d, A) graph is a d-regular graph on n vertices, whose largest non-trivial eigenvalue in 
absolute value is A. As mentioned above, the condition A < d is sufficient and necessary for 
the random walk on G to converge to vr. In this case, the quantity X/d governs the rate of this 
convergence: the mixing rate of Ai is defined to be limsupjrj^oQ \Puv^ — 7r{v)\^^^, where Puv^ is the 
probability that M reaches v in the k-th step given that it started from u. It is well known (see, 
for instance, [llj) that the mixing rate of the simple random walk on an (n, d, A) graph is X/d, and 
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that in fact, the L2 distance between tt and the distribution of M after k steps is at most (X/d)^. 
Therefore, after ^{log^^^^n) steps, the L2 distance between the distribution of the simple random 
walk and the uniform distribution is at most l/n^^^\ 

A useful and well known application of random walks on expander graphs is the following result, 
related to the conservation of random bits (cf., e.g., [1], Corollary 9.28). Suppose G is an (n,d. A) 
graph, and C/ is a predefined set of an vertices, for some a > 0. Then a random walk of length 
k, starting from a random vertex, avoids U with probability at most (1 — a + d^)^- Indeed, the 
term (1 — a)'' is the probability to miss U when selecting k vertices, uniformly and independently, 
in which case we would require k log n random bits for the selection process. Using a random walk, 
we require only log n + k log d random bits, at the cost of increasing the base of the exponent by an 
additive term of a^. This enables amplifying the error-probability in randomized algorithms (such 
as the Rabin-Miller primality testing algorithm) using fewer random bits: an algorithm utilizing 
s-bit seeds can be amplified k times using s + Q{k) random bits via a walk on a constant-degree 
expander, instead of sk random bits in the naive approach. 

In many applications of random walks on graphs, forbidding the random walk to backtrack 
appears to produce better results; an example of this is the construction of sparse universal graphs 
in [3], where a crucial element is a non-backtracking random walk on a high-girth expander. A 
non-backtracking random walk of length k on G, starting from some vertex wq, is a sequence 
W = (wq, . . . ,Wk), where Wi is chosen uniformly over all neighbors of Wi-i excluding ■Wi-2- The 
mixing-rate of a non-backtracking random walk on a regular graph, in terms of its eigenvalues, was 
computed in [2], using some properties of Chebyshev polynomials of the second kind. It is shown 
in [2] that this rate is always better than that of the simple random walk, provided that d = n°^-^\ 
In fact, the mixing rate of the non-backtracking random walk may be up to twice faster, and the 
closer the graph is to being a Ramanujan graph (that is, a graph satisfying A < 2\/d — 1), the 
closer the ratio between the two mixing-rates is to 2{d — l)/d. 

As an application, the authors of [2] analyzed the maximal number of visits that a non- 
backtracking random walk of length n makes to a vertex of G, an (n, d, A) graph of fixed degree 
and girth 17 (log log n). Using a careful second moment argument, they proved that this quantity is 
typically (1 -|- ^^°f^^ , as is the typical maximal number of balls in a single bin when throwing 
n balls to n bins uniformly at random (more information on the classical balls and bins experiment 
may be found in [7J) . In contrast to this, it is easy to see that a typical simple random walk 
of length n on a graph as above visits some vertex r2(logn) times. The authors of [2j further 
asked whether it is possible to establish the precise distribution of the number of visits that a 
non-backtracking random walk makes on a graph G as above. 

In this paper, we answer the above question, by combining a generalized form of Brun's Sieve 
with extensions of some of the ideas in [2]. This approach shows that even if the girth of G grows 
to infinity arbitrarily slowly with n, then for any fixed t, the fraction of vertices visited precisely 
t times is typically (1 + o(l))n/(et!), where the o(l)-term tends to as n — > 00. The extension of 
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Brun's Sieve, which includes an estimate on the convergence rate of the variables, is used to treat 
the case where t depends on n, and allows the characterization of the number of vertices visited t 
times for all t. In particular, this provides an alternative proof for the result of [2] of the typical 
maximum number of visits to a vertex, with an improved error term. These results are summarized 
below. 

Throughout the paper, all logarithms are in the natural basis, and an event, defined for every 
n, is said to occur with high probability or almost surely if its probability tends to 1 as n ^ oo. 

1.2 Results for expanders with a non-fixed girth 

A simple argument shows that the requirement for a non-fixed girth is essentially necessary if one 
wishes that the number of visits at vertices will exhibit a Poisson distribution. Indeed, if G is a 
graph where every vertex is contained in a cycle of a fixed length, the probability that the walk 
W will traverse a cycle t consecutive times, becomes much larger than the Poisson probability of t 
visits to a vertex for a sufficiently large t. Simple random walks correspond to the case of cycles of 
length 2. 

On the other hand, this requirement on the girth turns out to be sufficient as-well: as long as 
the girth of an (n, d, A) graph G tends to infinity arbitrarily slowly with n, the number of visits that 
a non-backtracking random walk makes to vertices exhibits a Poisson distribution, as the following 
theorem states: 

Theorem 1.1. For fixed d > 3 and fixed X < d, let G be an (n, d. A) graph whose girth is at least 
g = Let W = (ifQ, . . . ,Wn) denote a non-backtracking random walk of length n on G from 

wq, and Nt the number of vertices which W visits precisely t times: 

Nt = \{v eV{G) : \{l <i <n:w, = v}\ =t}\ . (1) 

Then for every fixed t, Nt/n = l/(et!) -|- o(l) almost surely, the o{l)-term tending to as n ^ oo. 

As we later mention, the above theorem in fact holds even for non-fixed A, d (as long as the 
spectral gap is large compared to d/g). The essence of the proof of Theorem II. II lies in proving that 
the variables counting the visits at vertices, whose pairwise distances are large, are asymptotically 
independent Poisson variables: 

Proposition 1.2. Let G be a graph as in Theorem \l.l\ For some fixed r and fJ- > 0, let vi, . . . ,Vr 

denote vertices of G whose pairwise distances are at least g. Let W be a non-backtracking random 
walk of length m = fin on G starting from vi , and Xi be the number of visits that W makes to Vi . 
Then {Xi, . . . , X^) {Zi, . . . , Zr) as n ^ oo, where the Zi-s are i.i.d. Poisson random variables 
with means fi, Zi ~ Po(/i). 

Remark 1.3: The statement of Proposition 11.21 holds (with the same proof) even if we replace 
the requirement on the girth of G with the weaker assumption, that vi, . . . ,Vr are not contained in 
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a closed nontrivial walk of length smaller than g. In this case, the presence of other possibly short 
cycles in G has no effect on the limiting distribution of (Xi, . . . , Xr). 

Remark 1.4: The parameters A and d in Theorem 1 1 . 1 1 and in Proposition 11.21 need not be fixed, 
as long as the spectral gap, d — X, is uj{d/g) (where g = uj{1) was the lower bound on the girth 
of G). For the sake of simplicity, we prove the case of fixed d, A, and later describe the required 
adjustments for the general case. 



1.3 Stronger results for high-girth expanders 

Letting Nf continue to denote the number of vertices visited t times by a non-backtracking random 
walk of length n on G, as in ([1]), consider the case where t is no longer fixed. In case we want 
to extend the result of Theorem 11.11 for values of t which depend on n, and approximate Nt by 
n/(ei!) uniformly over all t, a behavior analogous to the balls and bins model, we need to assume 
a larger girth. As noted in [2], there are d-regular expander graphs with girth g, where a typical 
non-backtracking random walk visits some vertex ^}{logn/ g) times. Therefore, the girth should be 
at least Q(loglogn) to allow the number of visits to exhibit a Poisson distribution for all t. 

Indeed, an r2(loglogn) girth suffices in order to approximate the above number of visits uni- 
formly over all values of t up to the asymptotically maximal number of visits: 

Theorem 1.5. Let G be as in Theorem li.il If g > lOlog^.^logn then for every fixed 6 > 0, the 
following holds with high probability: 



N. J 



n/{et\) 



< -r-rn for all t<F(l-S) , 

— log log log n J \ / ' 



Nt = for all t> F{l + 5) , 

where {Nt} are the variables defined in and F{x) = (l + x ^°fj°fj°g" ) \^^- 

The above theorem is also related to the notion of conserving random bits, which was mentioned 
earlier (where the goal was to simulate a distribution which had an exponentially small probability 
of avoiding a set, using a linear number of random bits). Indeed, using 0(n) random bits, it is 
possible to simulate a distribution which resembles the resulting distribution of throwing n balls 
to n bins uniformly and independently, as opposed to the naive approach, which requires nlogn 
random bits. 

Theorem 1 1 . 5 1 also immediately gives the result of [2] regarding the maximal number of visits that 
a non-backtracking random walk makes to a single vertex, with an improved threshold window, 
replacing the o ( i^gf^g^ ) error term by o ( ffoSog°«/°^ ) • 

Corollary 1.6. For any fixed d > 3 and fixed X < d the following holds: if G is an (n, d, A) graph 
whose girth is larger than 101og^_^ logn, then the maximal number of visits to a single vertex made 
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by a non-backtracking random walk of length n on G is with high probability 



( 



1 + (1 + 0(1)) 



log log log n 



log log n 



) 



log log n 



log n 



where the o{l)-term tends to as n ^ oo. 

The proof of Theorem 11.51 follows from a result analogous to Proposition 11.21 however, since we 
require an estimate on the rate of convergence to Poisson variables, we require an extended form of 
a multivariate Brun's Sieve (Proposition 13. 1( ). The proof may be applied to a more general setting, 
where the set of visited vertices has size depending on n, and the walk is of length uj{n). However, 
for the sake of simplicity, we work in the setting of Theorem 11.51 that is, a fixed set of vertices and 
a walk of length Q(n), as stated in the following proposition: 

Proposition 1.7. Let G be a graph as in Theorem \l.l\ For some fixed r and fj- > 0, let vi, . . . ,Vr 

denote vertices of G whose pairwise distances are at least g. Let W be a non-backtracking random 
walk of length m = fin on G starting from vi , and Xi be the number of visits that W makes to Vi . 
If 9 ^ clog^_x logn for some fixed c > 6 then 



where Z ~ Po(/i) and the o{l)-term tends to as n ^ oo. 

Remark 1.8: As in the case of Proposition 11.21 (see Remark ll.Sp . the requirement on the girth 
in Proposition 11.71 may be replaced with the assumption that {vi, . . . ,Vr} are not contained in a 
closed nontrivial walk of length smaller than g. 

I. 4 Organization 

The rest of the paper is organized as follows: in Section[2]we prove Theorem 1 1 . 1 1 and Proposition [L2] 
concerning expanders with a non- fixed girth. In Section Owe formulate and prove the multivariate 
version of Brun's Sieve which specifies the rate of convergence to the limiting distribution. This 
version is subsequently used in Section[l]to prove the above mentioned Theorem 1 1.5 1 and Proposition 

II. 7[ The final section. Section [SJ is devoted to concluding remarks and some open problems. 

2 A Poisson approximation for expanders with a non-fixed girth 
2.1 Proof of Proposition [TT2] 

For the simpler goal of proving Poisson convergence without estimating its rate, we will need the 
following known results. The well known univariate version of Brun's Sieve states the following: 



Pr[nLiX,=t,] 
nUPr[Z = t,] 



1 <0((logn)^) for allti,...,tr e {O,l,...,[logn\} 
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Theorem 2.1 (Brun's Sieve). Let X = X{n) he a sum of indicator variables, and let fi > 0. If for 

every r, lim„^oo ^C^) = Z^'"/''!; then X Z as n ^ oo, where Z ~ Po(//). 

See, e.g., [3] (pp. 119-122) for the derivation of this result from the Inclusion-Exclusion Principle 
and the Bonferroni inequalities [6], as well as for several applications. A multivariate version of 
Brun's Sieve is stated in [S], and a proof of the multivariate version by induction (using Brun's 
Sieve once for the base of the induction, and once more for the induction step) appears in |15j : 

Theorem 2.2 (Multivariate Brun's Sieve). Let Xi = X(n), . . . ,Xr = X,.(n) denote sums of indi- 
cator variables, and let /Ui, . . . , fir > 0- If for every ti, . . . ,tr, lim„^oo E[ni=i ( ^')] ~ 111=1 /^iV*«^ 
then {Xi, . . . , Xr) (^i, . . . , Zr), where the Zi-s are independent Poisson variables, Zi ~ Po(;Uj). 

While a stronger version of Brun's Sieve is proved in Section [3l (Proposition [XT]) . Theorem 12.21 
suffices for the proof of Proposition 11.21 where the rate of convergence to the Poisson distribution 
is not specified. Indeed, letting Xi, . . . ,Xr and fi be as in Proposition II. 2[ we need to prove that 

„ll-':in(t)l = nf for alU... (3) 

i=l ^ * ^ 1 = 1 * 

Fix integers ti,...,tr, and set t = X^-^j^ti. Let {vi,wi, . . . ,Wm) denote the path of the non- 
backtracking random walk W, and for all i £ [r] and j E [m] let Xy denote the indicator for the 
event that W visits Vi in position j; that is, Xij = l^^^^y.y, and by definition, Xi = YlJLi ^ij- It 
follows that: 



E 



[^(t;)]^5:'/'[nnA•«^l 



(4) 



where ranges over Ii,... ,1^ C [m] with = ti. We will rewrite the right-hand-side of the 
above equation. To this end, set 

L = (logn)2, (5) 

and let g = u;(l) be a lower bound for the girth of G, which satisfies g = o{L) (such a g exists 
by the assumption on G). For all s G {0, . . . , t = denote the collection of r-tuples 

(/i, . . . , If) where: 

• Ii, . . . ,Ir are disjoint subsets of [m] and \Ij\ = tj for all j. 

• There are precisely s consecutive elements of Uj/j U {0} whose distance is less than L. 
In other words: 

T- f.r 7- ^ \J = {xi,X2,..., xt} C [m], xo = 0, \Ij\=tj ioT all j, \ 

ls = < (Jl, . . . ,irj : , , Jin/-/. n I f ' 

I Xi-i < Xi for all I, and \{l < i < t : Xi — Xi-i < L\\ = s J 

Notice that the events Xij = 1 and Xiij = 1 are disjoint for i ^ %' . Therefore, ^ takes the following 
form: 



4n(f')]=^ ^ prinn^«=ii- m 

i=l^*^ s=0 (/i,...,7,,)eX« i6[r]i6/. 
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The fohowing claim estimates the probabihty that a non-backtracking random walk, starting from 
some given Vi, would end up in some given vj after less than L steps, as well as after some given 

~(k) 

k > L number of steps. Here and in what follows, the notation P^v denotes the probability that a 
non-backtracking random walk of length k, which starts in u, ends in v. 

Claim 2.3. Let G be as above, and define: M = maxjjg[j.] ^^^^ PifJ^ . Then M = o(l) and 
PviVj = ^^'^^'^ for all k > L and i,j G [r], where in both cases the o{\)-term tends to as n ^ oo. 

Proof. We need a few results on the mixing of non-backtracking random walks, proved in [2j- The 
mixing-rate of a non-backtracking random walk on G is defined as: 

p{G)= lim sup max \p(f^) - -\'/' . (8) 
Theorem 1.1 of \2\ determines the value of p as a function of A and d: 

^=^7^'"^"^^^^"^ = \ 1 IfO<x<l. 

As shown in [2], one can verify that p < max{^, -^i^}, and in our case, as A and d are both fixed, 
so is < /9 < 1. Furthermore, by the proof of the above theorem, 

max|pW-i|<(l + o(l))p\ (10) 

where the o(l)-term tends to as /c ^ oo, and is independent of n. In particular, by the choice of 
L to be (logn)^, for every sufficiently large n we have 



P^J;) = for all A; > L and ah u, v. 

n 



(11) 



Take some i,j € [r] (not necessarily distinct). By the assumption on the pairwise distances of 
vi, . . . ,Vj. and the girth of G, we have PviVj = for all k < g. On the other hand, (jlOp and the fact 
that g = Lo{l) imply that Pit^ < ;^ + (1 + o(l))/0^ for all k > g, giving the upper bound: 

L-l L~l 

P 



The required result now follows from the fact that L = o(n), p is fixed and g = uj(1). ■ 

For convenience, when examining some element (/i, . . . , Ir) G^s, we use the following notation: 
denote hy ii, ... ,is £ [m] the s indices of the Xi-s which satisfy \xi — Xi-i\ < L, as in ([6]), the 
definition of Zg- In addition, for every i G [m], let v{xi) denote the vertex vj, where j G [r] is the 
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single index satisfying Xi G Ij. The following holds: 

(Ii,...,lr)els jG[r]ie/i 



< 



ti,...,trj\t- sj\sj \ n J 



• • • ,ir-) G - s) ( n ) ^ ■ ^^'^^ 

Letting ^(s) denote the right hand side of (fT2]) . it follows that for all s < t: 



e(s) (m-t + s + l)(s + l) l + o(l) 

We deduce that 



M = e(M) = o(l) 



■[n(t)]=E E p.inn^...=ii 



r 

<EC(^)<(l + o(lMO) = (l + o(l] 

s=0 

,t 



rn\ / 1 + o(l) 



= (l + o(l))=^. (13) 

For the other direction, consider J, the collection of all r-tuples of disjoint subsets of [m] \ [L], 
{Ii, . . . , Ir), where \Ij\ = tj and the pairwise distances of the indices all exceed L: 

j^) . Ujlj = {xi,X2,---,xt} C {L + l,...,m}, 1 ^^^^ 
1 \Ij\ = tj for all j and Xi > + L for all i j 

Since C Us^:^ and PifJ^. = for ah A; > L and i,j e [r] (Claim ES]), we get: 



4n(t)]s E 



l-o(l)V / t \(m-Lt\(l-o{l) 



n J \ti,...,trj\ t J \ n 



= (l + o(l))=^. (15) 

Inequalities (I13p and ()15p imply that ([3]) holds, completing the proof of Proposition ll.2[ ■ 

Remark 2.4: The assumption that G is an (n, d, A) graph for some fixed d > 3 and fixed A was 
exploited solely in Claim [231 In fact, the proof holds whenever for some L = o(n) and g = uj{1), 
g < L, the girth of G is at least g and = o(l). Suppose that d > 3 but X,d are no longer 
fixed. Recalling that p < max{^, the requirements of Proposition 11.21 may be replaced, for 

instance, with G being an (n, d, A) graph of girth larger than g = uj{l), where d — X = Lo{d/g). 
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2.2 Proof of Theorem [TT] 



To prove the theorem, we use the estimates given by Proposition 11.21 for the cases r = 1,2, and 
apply a simple second moment argument. The assumptions of the theorem imply that for any two 
vertices u,v €zV, whose distance, as well as their distance from wq, are all at least g, we have: 



Pr[X„ = = ^ + o(l) , 

FiiXu = X^ = t]= Fr[Xu = + o(l) for every fixed t , 



(16) 
(17) 



where the two o(l)-terms tend to as n — > oo. Let g = g{n) be such that the girth of G is at least 
g, and in addition, g = o(logn). Let Nt denote the number of vertices which W visits precisely t 
times; we wish to obtain an estimate on the probability that A'^t = (1 + o(l))n/(et!). As t is fixed, 
the effect of any o{n) positions along W have on this value is negligible, and we may ignore the set 
of vertices whose distance from wq is less than g. Therefore, let U denote the set of vertices whose 
distance from wq is at least g, and define 



■t] 



Since \U\ > n — d{d — 1)^ ^ = (1 — o(l))n, we have: 

\Nt - N[\ 



n/{et\) 

and thus, showing that iV/ = (1 + o(l))?7-/(et!) almost surely will complete the proof. By ([16 



(18) 



n 



Eiv; = (i + o(i))- 



and denoting by 5{u,v) the distance between two vertices u,v, we deduce the following from (jl7p : 
VariNl) <ENl+Y,Y.{ ^'^^^ = X, = t] - Pr[X„ = t] Pr[X, = t]' 



5(u,v)<g 

< (1 + o(n)) ENl + o(n^) = o{n^) 



(19) 



Chebyshev's inequality now gives: 



Pr 



n 



n{n) = O (Var(iV;)/n2 



oil) 



completing the proof of the theorem. 
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3 Multivariate Brun's Sieve with an estimated rate of convergence 



Recall that the versions of Brun's Sieve stated in Section [2l (Theorem 12.11 and Theorem 12. 2p do not 
specify the rate of convergence to the Poisson distribution, and furthermore, the inductive proof 
of the multivariate case (which appears in [15]) gives an undesirable extra dependence of the rate 
of convergence on the number of variables. We therefore prove the next version of Brun's Sieve, 
which follows directly from a multivariate version of the Bonferroni inequalities: 

Proposition 3.1. Let Ai = {Aij : j £ [Mi]}, i E [r], denote r classes of events, and denote by 
Xi = 'i-Aij the number of events in Ai which occur. Suppose that for some integer T and 

some choice of e, s, fii, . . . , fir > satisfying s > /j, and 2^ < e < (2re^)~^, where jj. = maxj 
we have: 

mu (?)] 



< e for all ii, . . . , G {0, 1, . . . , r{T + 2s)} 



Then: 



< e' for all ti, . . . , e {0, . . . , T} 



nUPr[Z,=t,] 

where e' = 2 exp(2 /Xj)e + y/e and Zi, . . . ,Zr are i.i.d., Zi ~ Vo{fii). 



(20) 



(21) 



Proof. We need the following known multivariate generalization of the Bonferroni inequalities: 
Theorem 3.2 ([12j). Let Ai = {Aij : j S [-A^j]}, i £ [r], denote r classes of events, and let 



Xi = 'i-Aij denote the number of events in Ai which occur. Define: 



j ^ hC[Ml] IrC[Mr] j = lkelj 



:[Ml] IrC[Mr 
\h\=il \Ir\=ir 



The following holds for all non-negative integers mi, . . . , rur, < mj < Mj, and k >0: 
A{2k + 1) < PvlHiXi = rui] < A{2k) , where: 



(22) 



(23) 



As the function f{x,k) = X^f=o^V^' satisfies \ f{x,k) — e^'| < 2^^ for all x with |x| < ^y^, the 
assumption on s implies that 



I 

X 



1=0 



< e for all A; > 2s — 1 and |x| < ^ 



(24) 



Let mi, . . . , nir E {0, . . . , T}, and set M = Y2i ^-i^d 



p = FrlDiZi = mi] = JJ 

i=l 



r m. 



mil 



(25) 
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Notice that r(T + 2s) < minjMj, otherwise we would get the contradiction e > I from (j20p . 
According to the notations of Theorem I3.2[ by (|20|) and the facts M = ^^[^^ rrii and rrij < T: 

(l-e)n^ - •^^'''■■■''''^ < (l+e)n^ ii,...,ir- G {0,... ,M + 2rs} . 

j •'' i •'' 

Therefore, the following holds: 

M+k , r / • N N 

t=M Y.h=t i=i 

M+k / r / • \ , ' 

Pi 



r M+k , . ^ i] 

n E m*-^' 

j=lij=mj ^ 
M+fc M+k 

Y Yl ^j:ij>M+k 

il=mi ir=mr 



*J-"^j 1 _)_ £ 



A/+fc 



n E 



n 



.1)(E 



ij)-M 



+ e 



(26) 



Let Ei,E2,E3 denote the final three expressions in (j26p . that is, A(A;) < Ei + E2 — E^. A similar 
calculation gives A(A;) > Ei — E2 — E^ (with room to spare, as we could have replaced E^ by a 
smaller expression by replacing e by — e). We therefore wish to provide bounds on Ei, E2, E^. For 
all > 2s — 1 we have: 



1 



El 
P 



r M+k 



1 



HE 



rrijj ij\ 



j=l j^=mj 

vaj AI —ni j+k 



i--n^ E 

j=l /=o 



< (1 + ee^)"^ - 1 < e^/2 _ 1 < (27) 
< ee'^, and the second follows from 



where the first inequality is by (p^ . as |l — e'^-' X]f=o ^^~Tr~ 
the assumption that e < (2re'^)~^. Similarly, for A; > 2s — 1: 

r A/+fc . . V ij r rrij 

^^=-n E (i)7Tfi-n'='"^;b=«=">(^Ef.)f ■ 



(28) 



For the bound on l-Bs], recall that M = "^iTrii, and hence, if ^jij > M + 2rs we must have 
it ^ + 2s for some t. Therefore, for all k > 2rs — 1: 

M+k M+k , r / ■ \ ,,*J \ 

E-EiE.>««(nfl)&) 

i\=m\ ir=mr 

M+k M+k 

< + Yl ••• lEi.>M+fc 



n 



rrij/ ij[ 



r M+k 

<(i+-)E E 

t=l it=r?it+2s 



^,,tine-^)<-xp(2i:..). 



(29) 
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where the last inequahty is by the fact ^ < e/2, which imphes that 



M+k 



mt\ (2s)! 



Altogether, combining ()27|) . (I28p and ()29p we get the following for /c > 2rs — 1: 

m 



1 



< + 2 exp (2 ^ = e' . 



The proof is completed by the fact that A(2rs) < Pr[niXj = mj] < A(2rs — 1). ■ 

4 A Poisson approximation for high-girth expanders 

In this section, we prove Proposition 11.71 and its corollary, Theorem 11.51 which are analogous to 
Proposition 11.21 and Theorem 11.11 but also provide an estimate on the rate of convergence to the 
limiting distributions. This is imperative when looking at vertices which are visited t times, for t 
tending to (X) with n. The proof of Proposition [LTl follows the ideas of the proof of Proposition [L2l 
where instead of the simple version of Brun's Sieve, we use Proposition 13.11 proved in Section [3l 



4.1 Proof of Proposition [TT71 

Recall that g > clog^_]^ logn for some fixed c > 6. We need the following definitions: 

< —75- for all M, t> S y and /c > t 



r = 


min < 
* I 


p{k) _ 


1 

n 


T = 


[log n 






h = 


(log n) 


3-f . 





Recalling (jlOp . for k = r2(logn) we have P, 



(fc) 1 



< n' 



-n(i) 



giving 



r = 0(log n) . 



(30) 

(31) 
(32) 

(33) 



According to the notation of Proposition 13. H set fii = fi for all i, let h play the role of e, and define 
h' to be the analogue of e': 

6-c 



h' = 2e2''^/i + Vh = {l + o(l))(log n)^ 
It follows from Proposition 13.11 that, in order to show that 

< 0{h') for all h, . . . ,tr e {0, . . . ,T} , 



(34) 



it suffices to show that for some s satisfying s > fj, and 2^ < h < (2re'^) ^ we have: 

< 0{h) for all ti,...,^^ € {0, 1, . . . , r(r + 2s)} . 



E[ni=i(t)] . 



(35) 
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Substituting s = T, the requirements T > fi and h < (2re'^) ^ immediately hold for a sufficiently 

T 

large n, as T = /i = o(l) and both /i and r are fixed. The requirement < h holds as well, 
since = 6xp(— (1 — o(l))(logn)(loglogn)), and for a sufficiently large n, this term is clearly 
smaller than h = exp (— O(loglogn)). Therefore, proving (j35p for s = T would complete the proof 
of the proposition, that is, we need to show that 



E[ni=i(t)] 1 



Xi 

'< 0(/i) for alHi,...,tr G {0,1,..., SrT} . (36) 



Let ti, . . . ,tr G {0, . . . , 3rT}, and set 

r 

t = ^ti< Sr'^T = 0(log n) 



i=l 



Let (fi, wi, . . . , Wm) denote the path of the non-backtracking random walk W, and as before, for 
all i £ [r] and j £ [m] let Xij denote the indicator for the event that W visits Vi in position j: 



m 

Y 



As in the proof of Proposition 11.21 we next define the collection T^, this time for L = t = O(logn). 
For all s G {0, . . . , t = ^ ij}, let Xg denote the collection of r-tuples (/i, . . . , I^) of disjoint subsets 
of [m], \Ij\ = tj, so that there are precisely s consecutive elements of Djij U {0} whose distance is 
less than r: 

T- f.r T\ [jjlj = {xi^X2,- ■■ ,xt} C [m], xo = 0, \Ij\=tj ior all j, \ 

I Xi-i < Xi tor ail I, and ||1 < x < i : Xj — Xi-i < r|| = s J 

The facts that the events Xij = 1 and Xi/j = 1 are disjoint for i ^ i' implies that 

*=i ^ * ^ (7i,...,/r)eJ^ ie[r]ie/i 
By definition (|30p . for all Vi, Vj and k > t, Pv^Vj < + n^^. The following claim estimates the 

~(k) 

sum of the probabilities PviVj over all k < t. 

Claim 4.1. Let G be as above, and define M = maxjjg[r] Yl,k<T ^'"^^j ■ Then M = O ^(logn)^~ 

Proof. Let Vi and Vj denote two (not necessarily distinct) elements of {f i, . . . , Vr}- By the assump- 
tion on the pairwise distances of vi, . . . ,Vr and the girth of G, PviVj = for all k < g. It remains 
to estimate Ylk=l Pvvvj ■ 

Set £ = [^^J , and consider U , the set of vertices of G whose distance from Vj is at most I. 
Since the girth of G is at least the induced subgraph of G on [/ is isomorphic to a d-regular tree. 
Next, examine a non-backtracking walk of length k > £ from utov; crucially, since the walk cannot 
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backtrack, the last £ vertices along the walk must form a path from a leaf of the above mentioned 
tree, up to its root. In each of the I steps along this path there is a probability of 1 — to stray 
from the path, hence < {d— Altogether, 

T-l 

r- g 



k=g 



as required. 



Letting ii,. . . ^ig G [m] denote the s indices of the Xj-s which satisfy \xi — < r in the 

definition ([37|) of T^, and defining v{xi) = vj, where j is such that Xi £ Ij, we have: 

(/i,...,/r)ex. ie[r]ie-fi 



< 



/ ^ ' ■ ■ / > 11 iiC^i i) 



^ • • • J y \t' — s J \s / \ n 



<| « ^ ' i±!l_, M-. (39) 



Let ^(s) denote the right hand side of (p9|) . Recalling that m = 0(n) and t = O(logn), the 
following holds for all s < t: 

= 7 .^ • ■ M = 0{t^M) = O ((logn)3-f ) = , 

where the last equality is by the fact that c > 6. Combining this with the fact that, as t = n"^^\ 
(1 + = 1 + 0(n-i+°(i)) = 1 + o{h), we get: 



< (l + 0((logn)3-f))-/l-<(l + 0(/i)): 



and: 



i=i ^ ^5=0 (/i,...,/,)ex, ie[r]jeu i=i 



EiM(:')i<>: E pr[fuix,, = i]<(i+o(/^))ii^. (40) 

' {h,...,Ir)(^Is 

For the other direction, consider J', the collection of all r-tuples of disjoint subsets of [m] \ [r], 
{Ii, . . . , Ir), where \Ij\ = tj and the pairwise distances of the indices all exceed r: 

^ jfr r^ [jjlj = {xuX2,...,xt} C {t + l,...,m}, \ 

I Mil = J ^^'^ ^i+i > 2:?, + T for all i I 



By the definition ^Qij of r, and the fact that (1 + n^^)* = 1 + 0(n~i+°(i)) = 1 + o(/i). 



i=i ^ ^ {h,...,ir)&J ie[r]jeh {h,...,ir)ej 

t \ fm-Tt\ fl-n~^y (l-o(/i))/u* 



ti,...,trj \ t J \ n J ni=i* 



(42) 
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Combining (j40p and (j42p . we obtain that (j36|) holds for all < ti,. . . ,tr < 3rT, completing the 
proof. ■ 



4.2 Proof of Theorem [13] 



The proof will follow from Proposition 11.71 using a second moment argument, in a manner analogous 
to Theorem ll.il Let g = 101og^_]^ logn; by the assumption on G, Proposition 11.71 implies that for 
any two vertices u,v £V, whose distance, as well as their distance from wq, are all at least g, we 
have: 



Pr[X„ = t] 



1 



l/(et!) 
Pr[X, = X, = t] 



O 



1 



l/(et!)2 

Let t be some integer satisfying 



logn 
= O 



for all t < logn , 



1 



logn 



for all t < log n . 



(43) 
(44) 

^ ^ . ^ ^ . _ (45) 

log logn / log logn 

and let Nt denote the number of vertices which W visits precisely t times. We wish to obtain an 
estimate on the probability that A'^t = (1 + o(l))n/(et!). The above choice of t implies that: 



, , logloglogn\ logn 

r < 1 + C — ; ; I ; ; fOr SOmC C < 1 



^ ^ ffi logloglogn* / ^l-o(l) 

— r > exp (1 — c — o(l)) log n— — = exp (1 — c)(logni ^ ' 

etl \ log logn 



(46) 



Hence, the effect of any (log n)*-^*^^) positions along W on this value is negligible, and we may ignore 
the set of vertices whose distance from wq is less than g. Therefore, let U denote the set of vertices 
whose distance from wq is at least g, let Xu {u £ U) denote the number of visits which W makes 
to u, and let Nj. = X^^g^ l{x„=t}- According to this definition, we have: 

\Nt-Nl\ 



n/(et!) «-Pi-(l-^)a°g- 
and it remains to determine the behavior of N^. By (|43p . 

Pv[Xu = tY 



(47) 



n/{etl) 

and we deduce from (H6]) that 



1 



E 



n/{et\) 



0(1/ logn) , 



n 



ENl = (1 - o(l))-^ = n (exp ((1 - c)(logn)i-°(i) 
Furthermore, denoting by 5{u, v) the distance between two vertices u, v, the following holds: 
Var(iV;) < EiV; + = X,=t]- Pr[X, = t] Fi[X, = t]] 



< 



-ueU veil 

S{u,v)<g 



ueu veil 

S{u,v)>g 



< (l + {lognfW)ENl + Ol^4^ 



o 



log n 



(48) 
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Set h = h{n) = log log log n. Applying Chebyshev's inequality gives the following: 



Pr 



n/{etl) 



1 

> - 
- h 



< Pr 



1 71 

\N!-EN!\ > — • — 
' * " - 2h etl 



O 



log n 



and summing this probability for all t in the range specified in (|45p (containing 0{ y^°f^^ ) values) 
we deduce that with high probability: 



N't 



n/{et\) 



1 



< 



1 



for alH < 1 + c 



log log log n \ log n 



log log n / log log n 



(49) 



log log log n 

Recalling the relation between Nl,Nt in (j47p . we obtain that when replacing by Nt, (|49p holds 
as well. It remains to show that with high probability, Nf = for all t > Iq, where 



to 



log log log n\ logn 

i + C — ; ; ; ; for SOmC C > 1 



log logn J log logn 



(50) 



Let u be a vertex of G, and let denote the number of visits that W makes to u. Consider W' , 
a non-backtracking random walk of length n on G starting from u. Proposition 11.71 (for the case of 
one variable vi = u) implies that: 

Pr[X = t] = for all t < logn , 

etl 

where X'^ counts the number of visits that W' makes to u. Clearly, the probability that X^ > to is 
bounded from above by the probability that > to (as we can always condition on the first visit 
to u). Therefore: 



Pr[X„ > to] < PrK > to] = 1 - J] Pr[X = /] < 



Kto 



eto! 



We deduce that the expected number of vertices with more than to visits satisfies: 



n 



E\{u eV:X^> toll < (1 + o{l))— = exp 

eto! 



(l-c)(logn)i-°W) =o(l) 



This completes the proof of the theorem. 



5 Concluding remarks and open problems 

• We have shown that the distribution of the number of visits at vertices made by a non- 
backtracking random walk of length n on G, a regular n-vertex expander of fixed degree 
and large girth, tends to a Poisson distribution with mean 1. Furthermore, if the girth is 
r2(loglogn) we prove the following concentration result: with high probability, the number 
of vertices visited t times is (1 + o(l))^ uniformly over all t < (1 — ^^°f^^ , and for all 
t > (1 + o(l))ij^^^f^ (in fact, the threshold window we get is sharper by a factor of °^giog^" )- 
In particular, we obtain an alternative proof for the typical maximal number of visits to a 
vertex in the above walk, and (slightly) improve upon the estimate of this maximum in [2]. 
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• The above result implies that the distribution of the visits at vertices made by a non- 
backtracking random walk of length n on an n- vertex expander of high girth is asymptotically 
the same as the result of throwing n balls to n bins independently and uniformly at random. 

• The main tool in the proof is an extended version of Brun's Sieve, which includes error esti- 
mates and may be of independent interest. Combining this result with some additional ideas, 
we show that the variables counting the number of visits to vertices, which are sufficiently 
distant apart, are asymptotically independent Poisson variables. This implies the required 
result on the overall distribution of the number of visits at vertices. 

• Theorem 11.51 characterizes the distribution of visits at vertices in non-backtracking random 
walk on a high-girth regular expander. For such a graph on n vertices, the values Nt/n 
converge to (1 + o(l))/(et!), where Nt is the number of vertices visited precisely t times in a 
walk of length n as above. Moreover, we show that the above convergence of {^} is uniform 
over all values of t up to roughly y^^f^ ^ , after which Nt is almost surely 0. 

It seems interesting to investigate this distribution, (^, . . . , as a parameter of general 
vertex transitive graphs, and determine it for additional families of such graphs. 

• Corollary 11.61 determines that the maximum number of visits to a vertex, made by a typical 
non-backtracking random walk of length n on a high-girth n- vertex regular expander, is 
(1 -|- o(l)) jo^^g„ (with an improved error term compared to the results of [2]). 

For which other families of d-regular graphs, with d > 3, is this maximum ©(j^l^f;^)? 

• The "random setting" , where n balls are thrown to n bins, uniformly at random, results in a 
maximal load of (1 -|- o(l)) jo^j^gy^ ; it seems plausible that this bound is the smallest maximal 
load possible for a non-backtracking walk on any regular graph of degree at least 3. Is it 
indeed true that for any n-vertex d-regular graph with d > 3, a non-backtracking random 
walk of length n visits some vertex at least (1 -|- o(l)) io^iog„ times almost surely? 

Acknowledgement We would like to thank Itai Benjamini for useful discussions. 
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